Dereverberation by Using Time-Variant Nature of Speech Production System
نویسندگان
چکیده
منابع مشابه
Dereverberation by Using Time-Variant Nature of Speech Production System
This paper addresses the problem of blind speech dereverberation by inverse filtering of a room acoustic system. Since a speech signal can be modeled as being generated by a speech production system driven by an innovations process, a reverberant signal is the output of a composite system consisting of the speech production and room acoustic systems. Therefore, we need to extract only the part ...
متن کاملSpeech Dereverberation
The effect of reverberation on speech is to cause it to sound distant and spectrally distorted and can also reduce intelligibility. Dereverberation is therefore an important speech enhancement process for hands-free terminals. This is a blind problem and currently an unsolved problem. This paper reviews existing approaches and discuss current work on this topic in two categories one based on pr...
متن کاملSpeech Denoising and Dereverberation Using Probabilistic Models
This paper presents a unified probabilistic framework for denoising and dereverberation of speech signals. The framework transforms the denoising and dereverberation problems into Bayes-optimal signal estimation. The key idea is to use a strong speech model that is pre-trained on a large data set of clean speech. Computational efficiency is achieved by using variational EM, working in the frequ...
متن کاملSpeech dereverberation using long short-term memory
Recently, neural networks have been used for not only phone recognition but also denoising and dereverberation. However, the conventional denoising deep autoencoder (DAE) based on the feed-forward structure is not capable of handling very long speech frames of reverberation. LSTM can be effectively trained to reduce the average error between the enhanced signal and the original clean signal by ...
متن کاملSpeech Dereverberation Using Fully Convolutional Networks
Speech derverberation using a single microphone is addressed in this paper. Motivated by the recent success of the fully convolutional networks (FCN) in many image processing applications, we investigate their applicability to enhance the speech signal represented by short-time Fourier transform (STFT) images. We present two variations: a “U-Net” which is an encoder-decoder network with skip co...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: EURASIP Journal on Advances in Signal Processing
سال: 2007
ISSN: 1687-6180
DOI: 10.1155/2007/65698